18 research outputs found

    Automatic Classification of African Elephant (\u3cem\u3eLoxodonta africana\u3c/em\u3e) Follicular and Luteal Rumbles

    Get PDF
    Recent research in African elephant vocalizations has shown that there is evidence for acoustic differences in the rumbles of females based on the phase of their estrous cycle (1). One reason for these differences might be to attract a male for reproductive purposes. Since rumbles have a fundamental frequency near 10Hz, they attenuate slowly and can be heard over a distance of several kilometers. This research exploits differences in the rumbles to create an automatic classification system that can determine whether a female rumble was made during the luteal or follicular phase of the ovulatory cycle. This system could be used as the basis for a non-invasive technique to determine the reproductive status of a female African elephant. The classification system is based on current state-of-the-art human speech processing systems. Standard features and models are applied with the necessary modifications to account for the physiological, anatomical and language differences between humans and African elephants. The long-term goal of this research is to develop a universal analysis framework and robust feature set for animal vocalizations that can be applied to many species. This research represents an application of this framework. The vocalizations used for this study were collected from a group of three female captive elephants. The elephants are fitted with radio-transmitting microphone collars and released into one of three naturalistic yards on a daily basis. Although this data collection setup is good for determining the speaker of each vocalization, it suffers from many potential noise sources such as RF interference, passing vehicles, and the flapping of the elephant’s ears against the collar

    Generalized Perceptual Linear Prediction (gPLP) Features for Animal Vocalization Analysis

    Get PDF
    A new feature extraction model, generalized perceptual linear prediction (gPLP), is developed to calculate a set of perceptually relevant features for digital signal analysis of animalvocalizations. The gPLP model is a generalized adaptation of the perceptual linear prediction model, popular in human speech processing, which incorporates perceptual information such as frequency warping and equal loudness normalization into the feature extraction process. Since such perceptual information is available for a number of animal species, this new approach integrates that information into a generalized model to extract perceptually relevant features for a particular species. To illustrate, qualitative and quantitative comparisons are made between the species-specific model, generalized perceptual linear prediction (gPLP), and the original PLP model using a set of vocalizations collected from captive African elephants (Loxodonta africana) and wild beluga whales (Delphinapterus leucas). The models that incorporate perceptional information outperform the original human-based models in both visualization and classification tasks

    Automatic Classification and Speaker Identification of African Elephant (\u3cem\u3eLoxodonta africana\u3c/em\u3e) Vocalizations

    Get PDF
    A hidden Markov model (HMM) system is presented for automatically classifying African elephant vocalizations. The development of the system is motivated by successful models from human speech analysis and recognition. Classification features include frequency-shifted Mel-frequency cepstral coefficients (MFCCs) and log energy, spectrally motivated features which are commonly used in human speech processing. Experiments, including vocalization type classification and speaker identification, are performed on vocalizations collected from captive elephants in a naturalistic environment. The system classified vocalizations with accuracies of 94.3% and 82.5% for type classification and speaker identification classification experiments, respectively. Classification accuracy, statistical significance tests on the model parameters, and qualitative analysis support the effectiveness and robustness of this approach for vocalization analysis in nonhuman species

    Application of Speech Recognition to African Elephant (Loxodonta Africana) Vocalizations

    Get PDF
    This paper presents a novel application of speech processing research, classification of African elephant vocalizations. Speaker identification and call classification experiments are performed on data collected from captive African elephants in a naturalistic environment. The features used for classification are 12 mel-frequency cepstral coefficients plus log energy computed using a shifted filter bank to emphasize the infrasound range of the frequency spectrum used by African elephants. Initial classification accuracies of 83.8% for call classification and 88.1% for speaker identification were obtained. The long-term goal of this research is to develop a universal analysis framework and robust feature set for animal vocalizations that can be applied to many species

    A Framework for Bioacoustic Vocalization Analysis Using Hidden Markov Models

    Get PDF
    Using Hidden Markov Models (HMMs) as a recognition framework for automatic classification of animal vocalizations has a number of benefits, including the ability to handle duration variability through nonlinear time alignment, the ability to incorporate complex language or recognition constraints, and easy extendibility to continuous recognition and detection domains. In this work, we apply HMMs to several different species and bioacoustic tasks using generalized spectral features that can be easily adjusted across species and HMM network topologies suited to each task. This experimental work includes a simple call type classification task using one HMM per vocalization for repertoire analysis of Asian elephants, a language-constrained song recognition task using syllable models as base units for ortolan bunting vocalizations, and a stress stimulus differentiation task in poultry vocalizations using a non-sequential model via a one-state HMM with Gaussian mixtures. Results show strong performance across all tasks and illustrate the flexibility of the HMM framework for a variety of species, vocalization types, and analysis tasks

    A Framework for Bioacoustic Vocalization Analysis Using Hidden Markov Models

    Get PDF
    Using Hidden Markov Models (HMMs) as a recognition framework for automatic classification of animal vocalizations has a number of benefits, including the ability to handle duration variability through nonlinear time alignment, the ability to incorporate complex language or recognition constraints, and easy extendibility to continuous recognition and detection domains. In this work, we apply HMMs to several different species and bioacoustic tasks using generalized spectral features that can be easily adjusted across species and HMM network topologies suited to each task. This experimental work includes a simple call type classification task using one HMM per vocalization for repertoire analysis of Asian elephants, a language-constrained song recognition task using syllable models as base units for ortolan bunting vocalizations, and a stress stimulus differentiation task in poultry vocalizations using a non-sequential model via a one-state HMM with Gaussian mixtures. Results show strong performance across all tasks and illustrate the flexibility of the HMM framework for a variety of species, vocalization types, and analysis tasks

    An analog approach for weather estimation using climate projections and reanalysis data

    Get PDF
    General circulation models (GCMs) are essential for projecting future climate; however, despite the rapid advances in their ability to simulate the climate system at increasing spatial resolution, GCMs cannot capture the local and regional weather dynamics necessary for climate impacts assessments. Temperature and precipitation, for which dense observational records are available, can be bias corrected and downscaled, but many climate impacts models require a larger set of variables such as relative humidity, cloud cover, wind speed and direction, and solar radiation. To address this need, we develop and demonstrate an analog-based approach, which we call a ‘‘weather estimator.’’ The weather estimator employs a highly generalizable structure, utilizing temperature and precipitation from previously downscaled GCMs to select analogs from a reanalysis product, resulting in a complete daily gridded dataset. The resulting dataset, constructed from the selected analogs, contains weather variables needed for impacts modeling that are physically, spatially, and temporally consistent. This approach relies on the weather variables’ correlation with temperature and precipitation, and our correlation analysis indicates that the weather estimator should best estimate evaporation, relative humidity, and cloud cover and do less well in estimating pressure and wind speed and direction. In addition, while the weather estimator has several user-defined parameters, a sensitivity analysis shows that the method is robust to small variations in important model parameters. The weather estimator recreates the historical distributions of relative humidity, pressure, evaporation, shortwave radiation, cloud cover, and wind speed well and outperforms a multiple linear regression estimator across all predictands

    Coupled impacts of climate and land use change across a river-lake continuum: Insights from an integrated assessment model of Lake Champlain\u27s Missisquoi Basin, 2000-2040

    Get PDF
    Global climate change (GCC) is projected to bring higher-intensity precipitation and higher-variability temperature regimes to the Northeastern United States. The interactive effects of GCC with anthropogenic land use and land cover changes (LULCCs) are unknown for watershed level hydrological dynamics and nutrient fluxes to freshwater lakes. Increased nutrient fluxes can promote harmful algal blooms, also exacerbated by warmer water temperatures due to GCC. To address the complex interactions of climate, land and humans, we developed a cascading integrated assessment model to test the impacts of GCC and LULCC on the hydrological regime, water temperature, water quality, bloom duration and severity through 2040 in transnational Lake Champlain\u27s Missisquoi Bay. Temperature and precipitation inputs were statistically downscaled from four global circulation models (GCMs) for three Representative Concentration Pathways. An agent-based model was used to generate four LULCC scenarios. Combined climate and LULCC scenarios drove a distributed hydrological model to estimate river discharge and nutrient input to the lake. Lake nutrient dynamics were simulated with a 3D hydrodynamic-biogeochemical model. We find accelerated GCC could drastically limit land management options to maintain water quality, but the nature and severity of this impact varies dramatically by GCM and GCC scenario

    Generalized Perceptual Features for Vocalization Analysis across Multiple Species

    No full text
    This paper introduces the Greenwood function cepstral coefficient (GFCC) and generalized perceptual linear prediction (GPLP) feature extraction models for the analysis of animal vocalizations across arbitrary species. These features are generalizations of the well-known mel-frequency cepstral coefficient (MFCC) and perceptual linear prediction (PLP) approaches, tailored to take optimal advantage of available knowledge of each species\u27 auditory frequency range and/or audiogram data. Illustrative results are presented comparing use of the GFCC and GPLP features versus MFCC features over the same frequency range

    Automatic Classification of Animal Vocalizations

    No full text
    Bioacoustics, the study of animal vocalizations, has begun to use increasingly sophisticated analysis techniques in recent years. Some common tasks in bioacoustics are repertoire determination, call detection, individual identification, stress detection, and behavior correlation. Each research study, however, uses a wide variety of different measured variables, called features, and classification systems to accomplish these tasks. The well-established field of human speech processing has developed a number of different techniques to perform many of the aforementioned bioacoustics tasks. Melfrequency cepstral coefficients (MFCCs) and perceptual linear prediction (PLP) coefficients are two popular feature sets. The hidden Markov model (HMM), a statistical model similar to a finite autonoma machine, is the most commonly used supervised classification model and is capable of modeling both temporal and spectral variations. This research designs a framework that applies models from human speech processing for bioacoustic analysis tasks. The development of the generalized perceptual linear prediction (gPLP) feature extraction model is one of the more important novel contributions of the framework. Perceptual information from the species under study can be incorporated into the gPLP feature extraction model to represent the vocalizations as the animals might perceive them. By including this perceptual information and modifying parameters of the HMM classification system, this framework can be applied to a wide range of species. The effectiveness of the framework is shown by analyzing African elephant and beluga whale vocalizations. The features extracted from the African elephant data are used as input to a supervised classification system and compared to results from traditional statistical tests. The gPLP features extracted from the beluga whale data are used in an unsupervised classification system and the results are compared to labels assigned by experts. The development of a framework from which to build animal vocalization classifiers will provide bioacoustics researchers with a consistent platform to analyze and classify vocalizations. A common framework will also allow studies to compare results across species and institutions. In addition, the use of automated classification techniques can speed analysis and uncover behavioral correlations not readily apparent using traditional techniques
    corecore